Penalized PCA approaches for B-spline expansions of smooth functional data
نویسندگان
چکیده
Functional principal component analysis (FPCA) is a dimension reduction technique that explains the dependence structure of a functional data set in terms of uncorrelated variables. In many applications the data are a set of smooth functions observed with error. In these cases the principal components are difficult to interpret because the estimated weight functions have a lot of variability and lack of smoothness. The most common way to solve this problem is based on penalizing the roughness of a function by its integrated squared d-order derivative. Two alternative forms of penalized FPCA based on B-spline basis expansions of sample curves and a simpler discrete penalty that measures the roughness of a function by summing squared d-order differences between adjacent B-spline coefficients (P-spline penalty) are proposed in this paper. The main difference between both smoothed FPCA approaches is that the first uses the P-spline penalty in the least squares approximation of the sample curves in terms of a B-spline basis meanwhile the second introduces the P-spline penalty in the orthonormality constraint of the algorithm that computes the principal components. Leave-one-out cross-validation is adapted to select the smoothing parameter for these two smoothed FPCA approaches. A simulation study and an application with chemometric functional data are developed to test the performance of the proposed smoothed approaches and to compare the results with non penalized FPCA and regularized FPCA. Many application areas have functional data that come from the observation of a random function at a discrete set of sampling points. In the majority of cases the sample curves are functions of time but in many others the argument is a different magnitude. In spectroscopy, for example, the NIR spectrum is a functional variable whose observations are measured as functions of wavelengths. The potential of functional data analysis methodologies for the chemometric analysis of spectro-scopic data was shown in [1]. A wide variety of applications with functional data in different fields were collected and analyzed by [2]. When analyzing a functional data set it is usual to have a large number of regularly spaced observations for each sample curve. Because of this a reduction dimension technique is necessary for explaining the main features of a set of sample curves in terms of a small set of uncorrelated variables. This problem was solved by generalizing principal component analysis to the case of a continuous-time stochastic process [3]. Asymptotic properties of the estimators of FPCA …
منابع مشابه
Use of Two Smoothing Parameters in Penalized Spline Estimator for Bi-variate Predictor Non-parametric Regression Model
Penalized spline criteria involve the function of goodness of fit and penalty, which in the penalty function contains smoothing parameters. It serves to control the smoothness of the curve that works simultaneously with point knots and spline degree. The regression function with two predictors in the non-parametric model will have two different non-parametric regression functions. Therefore, we...
متن کاملSimultaneous Confidence Bands for Penalized Spline Estimators
In this paper we construct simultaneous confidence bands for a smooth curve using penalized spline estimators. We consider three types of estimation methods: (i) as a standard (fixed effect) nonparametric model, (ii) using the mixed model framework with the spline coefficients as random effects and (iii) a Bayesian approach. The volume-of-tube formula is applied for the first two methods and co...
متن کاملEstimating penalized spline regressions: Theory and application to economics
In this paper we give a brief survey of penalized spline smoothing. Penalized spline smoothing is a general non-parametric estimation technique which allows to fit smooth but else unspecified functions to empirical data. While penalized spline regressions are quite popular in natural sciences only few applications can be found in economics. We present an example demonstrating how this non-param...
متن کاملEstimating and Interpreting Effects from Nonlinear Exposure-Response Curves in Occupational Cohorts Using Truncated Power Basis Expansions and Penalized Splines
Truncated power basis expansions and penalized spline methods are demonstrated for estimating nonlinear exposure-response relationships in the Cox proportional hazards model. R code is provided for fitting models to get point and interval estimates. The method is illustrated using a simulated data set under a known exposure-response relationship and in a data application examining risk of carpa...
متن کاملPenalized spline models for functional principal component analysis
We propose an iterative estimation procedure for performing functional principal component analysis. The procedure aims at functional or longitudinal data where the repeated measurements from the same subject are correlated. An increasingly popular smoothing approach, penalized spline regression, is used to represent the mean function. This allows straightforward incorporation of covariates and...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Applied Mathematics and Computation
دوره 219 شماره
صفحات -
تاریخ انتشار 2013